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Abstract 

Gossip algorithms are widely used to solve the distributed consensus problem, but issues can arise when nodes receive multiple 
signals either at the same time or before they are able to finish processing their current work load. Specifically, a node may assume a 
new state that represents a linear combination of all received signals; even if such a state makes no sense in the problem domain. As a 
solution to this problem, we introduce the notion of conflict resolution for gossip algorithms and prove that their application leads to 
a valid consensus state when the underlying communication network possesses certain properties. We also introduce a methodology 
based on absorbing Markov chains for analyzing gossip algorithms that make use of these conflict resolution algorithms. This 
technique allows us to calculate both the probabilities of converging to a specific consensus state and the time that such convergence 
is expected to take. Finally, we make use of simulation to validate our methodology and explore the temporal behavior of gossip 
algorithms as the size of the network, the number of states per node, and the network density increase. 
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I. Introduction 

The distributed consensus problem asks how every node in a network can adopt the same state value for a given state variable 
when there is no centralized coordination mechanism [ ]-[ ]. Applications of this problem frequently arise in the self-organization, 
cooperation, and coordination of complex and multi-agent systems. Some of the more commonly studied areas include resource 
location, formation flight of UAVs, attitude alignment of clusters of satellites, self-organization, automated highway systems, 
congestion control in communication networks, load balancing, rendezvous in space, distributed sensor fusion, belief propagation, 
convention emergence, and task allocation [ ]-[ ]. The distributed consensus problem is also similar to the study of opinion 
dynamics and the spread of social norms in computational sociology [8], [9]. 

In this paper, we propose a solution to the distributed consensus problem that makes use of gossip algorithms that allow 
for nodes to receive multiple simultaneous transmissions, either through parallel synchronized clocks or random chance. This 
approach is in contrast to the existing research on gossip algorithms in which it is assumed that each node receives only a single 
incoming transmission [ ], [ ]. 

Our solution builds on the representation of gossip algorithms as linear systems [10] where the state update equation is defined 
as x(f + 1) = W(r)x(f); where x(f) is the state vector of the nodes at time t and W is a random matrix that determines how 
each node updates its state at time t. Wij = 1 if node j transmits to node i, and otherwise. Under this definition, the reception 
of multiple simultaneous transmissions can result in undesired values of x(f + 1) by allowing nodes to take on a value that is 
a linear combination of the transmitting nodes. In order to avoid this undesired behavior, we redefine the state update equation 
as x(f + 1) = /(W(f ))x(r); where /(W(f)) is a binary relation between W(f) and a set of row stochastic matrices composed of 
{0,1} entries. For simplicity, we call / the conflict resolution algorithm. 

The motivation for our research stems from two fronts. First, gossip algorithms are widely used in solving the distributed 
consensus problem. As computing power increases and computing components become cheaper, it will become easier and easier 
to build very large decentralized systems. As the size of these systems increases, so too will the complexity of coordinating 
them. It is therefore important to ensure that they are robust to errors or other unforeseen events such as the reception of multiple 
simultaneous signals by a single autonomous component. 

Second, the theoretical investigation of gossip algorithms is primarily centered on asynchronous and synchronous timing 
models, both of which guarantee that only two nodes are ever active at once. This ensures that no node ever receives more than 
a single transmission. This assumption simplifies analysis but, as we show, leaves open the door for interesting and unexpected 
behaviors upon violation. Such violations may occur as a result of misaligned clocks that allow multiple nodes to fire at once, or 
as a result of nodes receiving incoming transmissions faster than they are able to process them. If a node processes information 
slower than it receives it, then the node must either ignore the incoming information, or store it in a queue. If a queue is used, and 
only one transmission is handled at a time, then there is a risk of queue overflow. If, however, multiple elements are processed 
from the queue, or if it is emptied and processed as a set of multiple simultaneous transmissions, special treatment will be 
required to resolve conflicting information. 
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Our solution to the distributed consensus problem is designed to handle conflicts that occur as a result of multiple simultaneous 
transmission by specifying a conflict resolution algorithm for each node. We will show that our solution is guaranteed to produce 
a consensus when certain assumptions hold and will describe one simple conflict resolution algorithm based on the random 
selection of an incoming state value. Furthermore, we will show that it is possible to predict the expected consensus state as 
well as the time required to reach that state. Finally, we will provide empirical data from computer simulations to validate our 
theoretical claims and then use our theory to explore how various network characteristics impact the temporal behavior of our 
solution. 

A note on notation: we indicate matrices and vectors with bold upper and lowercase symbols, e.g. M for matrices and v 
vectors. Individual elements will be indexed, non-bold, lowercase symbols, e.g. for matrices and v, for vectors. 

II. Related Work 

Given a network G — {V,E) in which every node possesses a state variable, x, a gossip algorithm is a method of decentralized 
information exchange in which one node u €V transmits the contents of its state variable, x„, to another node, y e V, where 
V is selected in accordance to some gossip mechanism. Upon reception of node m's transmission, node v updates its own state 
variable, x,,, according to some gossip protocol [ 1], [ii]. Much of the recent work on gossip algorithms make use of a gossip 
mechanism in which node v is selected uniformly at random from the local neighborhood of node m [],[], [ ], [ ], but gossip 
mechanisms do exist where node v is selected from the entire population [14]-[22] or node u transmits to all local neighbors via 
flooding [ ] or broadcasting [""]. Transmission can also be bidirectional [ 1 0], [II], ['' '] or unidirectional [ ]-[ ]. In all cases, 
the frequency of transmission is controlled by an internal clock that ticks according to either an asynchronous timing model or 
synchronous timing model. 

Under an asynchronous timing model, every node in the network has a clock which ticks according to a Poisson process with 
a rate of X. This is equivalent to a single clock that ticks according to a Poisson process with a rate of nk, where n= \V\ is the 
number of nodes in the network [ ]. In practice, this means that at each tick an average of nX nodes are chosen independently 
and uniformly at random to transmit their state values. 

Under a synchronous timing model, every node in the network has a clock that ticks at the same frequency. This results 
in all nodes transmitting their state values at exactly the same time. Because transmission occurs in parallel, any information 
received during a tick cannot be propagated further until the following tick. If it is required that transmission be pairwise disjoint, 
precautions must be taken to ensure that nodes are not the targets of multiple transmissions when a synchronous timing model 
is used. 

In the existing analysis of gossip algorithms under both synchronous and asynchronous timing models, the gossip constraint 
is often observed. The gossip constraint is responsible for the assumption a node will never receive multiple simultaneous 
transmissions because with a probability of at least 1 — 1/n [ ] only two nodes will ever be in communication at the same time 
in the case of bidirectional transmission; and in the case of unidirectional transmission, only one node will transmit at a time. 
For the synchronous timing model, this implies that nodes must be matched so that their transmissions form disjoint pairs. This 
constraint greatly simplifies analysis and implementation of gossip algorithms in real world systems [ ], but as we will show 
through the creation of gossip protocols that handle conflicting information at the node level, it is not strictly required in order to 
obtain a consensus. Furthermore, by allowing conflict to occur, it becomes much more natural to implement synchronous timing 
models, since matching is not required, and improbable conflict events under asynchronous timing models require no special 
treatment or attention. 

When information is spread through gossip in such a way that the gossip constraint is violated, we will use the term 
unconstrained gossip. We will call gossip algorithms that are designed to handle the reception of multiple simultaneous signals 
unconstrained gossip algorithms. 

III. Problem Specification 

Consider a directed graph, G = {V,E) defined by a set of n nodes, V, and a set of edges, E — {{u,v) : u,v E V}, such that 
node u points to node v. The neighbors of node u are given by N{u) — {v : {u,v) £ E A u v}. Next, assume that time can be 
broken in to discrete intervals, where f denotes the current interval. Let each node, u £V, possess a clock that ticks or 1 times 
per interval such that the node acts only when the clock ticks, and a state variable x„ G S whose value is time-dependent and in 
the set of valid state values, S. When the clock of node u ticks, the value of is transmitted along a single outgoing edge to a 
single neighbor, v e N{u). Node v updates x,, upon reception of x,, according to the state update equation Xy — h{xu). 

If h{-) is linear, then the state update equation Xy = h{xi,) can be vectorized to allow the simultaneous transmission of state 
values. In this vectorized form, transmission at time f can be written as the linear system 

x(f + l) =W(f)x(f) (1) 

where x(f) e M"^' is the column vector of node states at time f, and W(f) G M"^" is the transmission matrix that specifies the 
source and target nodes of the transmission activity at time t. W is an independent and identically distributed random matrix 
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Fig. 1. The directed grapli G = ({1, 2, 3,4}, {(1, 2), (1, 3), (3,4)}). 



whose value is determined as 



{ 



1 if node j transmits to node / 
otherwise 



UnHke previous research on gossip-based distributed consensus algorithms ["*], [22], [23], [27], the transmission matrix we 
examine is not guaranteed to be row stochastic. This is a direct result of not enforcing the gossip constraint and allowing multiple 
simultaneous transmissions within the network. As a consequence, nodes may take on undefined state values, and consensus 
may be impossible to reach. In the rest of this paper we discuss how this consequence can be mitigated and describe a set of 
algorithms for unrestricted gossiping that can lead to the formation of a consensus despite the positive probabihty that nodes 
will receive multiple simultaneous transmissions. 



Given a communication network where nodes can receive multiple simultaneous transmissions, it is possible to reach a 
consensus state if there exists an algorithm, /, that transforms the transmission matrix, W, into a row stochastic matrix, A. It is 
sufficient for / to produce a row stochastic matrix because it has been previously proven that if a matrix A is row stochastic, 
then x(f + l) = A(f )x(f ) will converge to a consensus as f — >■ oo [10], [25], [27]. Furthermore, if A is row stochastic, then the 
consensus state is a fixed point of the system and as such will not change unless acted upon by an external influence. 

Let / : M"^" — > M"^" be an algorithm that transforms W into A, then equation 1 can be rewritten as 



The adoption matrix denotes which transmitting nodes will actually be used to update the states of the receiving nodes. 

A. Stability of the consensus state 

The consensus state of (2) is the vector x^, such that x,- —Xj for all Xi^Xj E x^. Thus x^ = kl, where k E S. 

Lemma 1: x^. is a fixed point of x(f + 1) = A(f)x(f). 

Proof: By construction, A(f) =/(W(f)) is row stochastic, so A(f)l = 1. Thus, 1 is an eigenvector of A(f) with an eigenvalue 
of A = 1. Because scalar multiples of eigenvectors are also eigenvectors, x^ = k\ is an eigenvector of A(f) with an eigenvalue 
X — I. So A(r)Xf = Xc, and thus the consensus state, x^, is a fixed point of x(f + 1) = A(f)x(f). ■ 

Thus, by lemma 1, if the system reaches a consensus it will remain there until acted upon by external forces. 

B. Proof of convergence 

Based on the idea of a consensus graph from Schmalz [25], [27], let a consensus sequence, Ac be a sequence of adoption 
matrices, {A(fi),A(f2)...A(fT)} with f i < f2 < • • • < such that x^ = A^itj) ■ • • A(f2)A(fi)x(fi). 

Lemma 2: If G has a directed spanning tree and G is not a directed ring network, then a consensus sequence, Ac, exists for 
the system associated with the communication network, G. 

Proof: If G has a directed spanning tree with a root (O eV , then a sequence of matrices can be constructed that pass down 
the state variable of (0 to each child, and then from each child to each grandchild, and so on down the tree until all nodes 
possess the state value of CO. If, however, G is a ring network then the last node would have no choice but to transmit its state 
value to the parent node. This action would produce an infinite loop. Thus G must not be a directed ring network. ■ 



'If W = I - "''f then W represents the average consensus algorithm. If W = \ + ei(ej — e,)^ then W represents a directed gossip algorithm. 



IV. Problem Solution 



We call A(f) 




(2) 




1 if node / adopts the state of node j 
otherwise 
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Example: Consider the graph G = {V,E) in figure 1 where 

2,3,4} 
£ = {(1,2),(1,3),(3,4)} 
^(0) = [1,2,3,4]^ 



Because this graph contains a directed spanning tree and is not a directed ring network there is a consensus sequence, 
Ac = {A(fi),A(f2),A(f3)}, such that 



A(?i) 



A(f2) 



A(f3) 



"1 








0" 


1 
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Theorem 3: If G has a directed spanning tree, is not a directed ring network, and A(f) —f{W{t)) is row stochastic, then the 
probabiHty of constructing a consensus sequence, Ac, through random selection of transmission matrices tends to 1 as the length 
of time tends to infinity. 

Proof: In accordance with previous work on directed gossip algorithms by Schmalz [25], [27], Let At be a finite time 
interval. Let p be the probability that a consensus sequence, Ac, occurs within Af. Because G has a directed spanning tree, we 
can set Af = \Ac\, and so clearly p > 0. Let pc be the event where a consensus sequence, Ac, occurs within a time interval 
r = /-Af. Then, 

limr^^P{pc) + limr^„P{l - pc) = I 
limr^aoP{l — Pc) = limr^a!,{l — pY 
linii-^ooil — pY — since p >0 
limr^ooP{pc) ~ 1 

■ 

Corollary 4: If the communication network, G, associated with x(f + l) = /(W(f))x(f) has a directed spanning tree and is 
not a ring network, then the system is guaranteed to converge to a consensus, x^. 

Proof: By direct apphcation of lemma 2 and theorem 3. ■ 

C. An example conflict resolution algorithm: Proportional Selection 

Having shown that the formation of a stable consensus is not only possible, but guaranteed, we now introduce proportional 
selection as an example of a conflict resolution algorithm. Proportional selection takes as input the transmission matrix, W, 
where = 1 if node j transmits to node / and otherwise. It produces as output the row-stochastic adoption matrix A, such 
that aij = 1 if node / adopts the state of node j and otherwise, with the additional constraint that Al = 1. 

Under proportional selection, the probability of adoption for each state value in the set of incoming transmissions is equal 
to the ratio of the number of times that state value occurs to the number of all received state values. For example, if a node 
receives the values {2,2,3} then there is a 2/3 chance of that node adopting the value 2 and a 1/3 chance of the node adopting 
the value 3. 

The proportional selection algorithm consists of two main steps. First initialize the adoption matrix A = 0. Next, for each row, 
/, in W, randomly select a single column, j, with a positive entry and then set A,,- = L If a row is composed of all O's then set 
A„ = 1 to denote that the node keeps its current value. This technique guarantees that each row only has a single 1 and all other 
entries 0, thus ensuring A is row stochastic. 
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V. Prediction of the Consensus State and Expected Time to Consensus 

Given a distributed consensus algorithm with the state update equation x{t + l) ~ A{t)x{t) where A(f) = /(W(f)) is the 
adoption matrix at time t and x(f ) is the state vector at time t taken from a finite discrete set of states S, it is possible to compute 
the probability of achieving each consensus state through the construction of a Markov chain; however, to use a Markov chain 
we must first construct the associated state space, H, and transition matrix, M. 

A. Generating the Markov State Space 

For the state space of the Markov chain, H, we let each element be a vector [xq x\ ■ ■ ■ x„]^ where x,- G 5 is the state 
variable of the /th node in G. Under this construction, H is then equal to the set of all possible permutations of [xq x\ ■ ■ ■ jc„]^ 
as Xi varies over 5'. Given the Markov state space, H, we define z to be a row vector where < z, < 1 denotes the probabiUty 
that the initial distribution of node values in G is equal to the /th Markov state and = 1. 

For example, let 5 = {0,1}, let /!i e//= [0 1]^ and let /15 e H = [1 1]^. If z = [0 1 0] then 
jco = 0, xi = 0, and X2 — I with probability 1; however, if z = [0 0.5 0.5 0] then xi = 0, and X2 = 1 with 
probability 1 but xq = with probability 0.5 and xq = 1 with probability 0.5. This particular definition thus defines z as the 
starting distribution of the Markov chain. It is anticipated that for most practical applications, the initial distribution of state 
values will be deterministic, and thus z, = 1 for some /; however, if the initial state of each node in G is determined according 
to a uniform distribution, then z, = l/l^l for 

The primary application of z is to explore the distribution of node states in the network at time f through the solution of zM', 
where M is the Markov transition matrix. 

B. Generating the Markov Transition Matrix 

Given that each state in the Markov chain represents the aggregate state of all nodes in the network, G, the transitions between 
states of the Markov chain represent the change in the distribution of node states. The probabilities of each transition are used 
to generate the Markov transition matrix, M, where mij is the probability that the network will transition from state / to state j. 
The specific probabilities depend on not only the topology of the network, but also the conflict resolution algorithm being used. 

For the directed graph G = let G' be the adjacency matrix of G with G\j = 1 if node / points to node j and otherwise. 

The construction of M requires multiple stages and depends on G, G' and the conflict resolution algorithm. 

The first stage in construction of the transition matrix, M, is to generate the state space of the Markov chain, H. This is 
accomplished by enumerating all permutations of the state values that can be taken by the nodes in G. For example, on the 
graph K2 = ({1,2}, {(1,2), (2,1)}) with 5 = {0,1}, there are 4 possible Markov states: [0 0]^, [0 1]^, [1 0]^, and [1 1]^. 
Every Markov state is represented by a node in the corresponding Markov chain's graph representation. 

The second stage is to generate the set of valid transmission matrices, for G. For every transmission matrix T G J?', f,y = 1 
if node j transmits to node / and otherwise. Because all transmission matrices must account for the graph topology, it must 
be the case that f,y = if G'- — 0. Furthermore, — because it is forbidden for a node to transmit to itself. Finally, T must 
be column stochastic because we are only considering gossip protocols in which nodes transmit to a single neighbor 

The third stage is to construct the set of all possible adoption matrices, si/ . Once all valid transmission matrices are generated, 
it is guaranteed by construction that they are column stochastic, but not row stochastic. This results in a set of matrices which 
may represent multiple simultaneous receptions by one or more nodes. In order to resolve this phenomenon, the transmission 
matrices must be transformed into the row stochastic adoption matrices through the application of the chosen conflict resolution 
algorithm. However, because the goal is to construct the Markov transition matrix M, it is essential to construct every possible 
adoption matrix A G jz/ such that aij = 1 if node / adopts the state of node j and otherwise; note that in an adoption matrix 
an = 1 if node / does not adopt any other node. As a result of this procedure, there is a high probability that duplicate adoption 
matrices will be generated. These duplicates must be eliminated. 

The fourth stage is to use the set of adoption matrices, ^ and the Markov state space, H, to generate the edges of the Markov 
chain's graph representation. For each Markov state h G H, let h' be the set of states reachable from h. Then the /th transition 
from h is given by h!^ = s^Jfi where 1 < / < |^|. Thus the set of outgoing edges from each Markov state is given as {(/i,/zj), 
{h^h'2), • • • , (/!,/!'^|)} for every h & H. Once all edges have been determined, each of them is assigned a weight- of 1 /|i</|. At 
this point, the Markov chain is represented by a multi-edge digraph. To transform it in to a simple digraph, first sum the weights 
of duplicate edges^^ and assign that value to a single edge; then remove all of the duplicates. Once this process is complete, the 
Markov chain is represented as a simple digraph with weighted edges that determine the transition probability from one state to 
another. 



^This specific weight value is due to a uniform probability of transmission among nodes. 
'We can sum the weights because the probability of transmission independent. 
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The final stage is to specify M. Following the construction of the Markov chain as a simple weighted digraph, M is then 
specified as follows. Let v, and Vj be the /th and yth node in the Markov graph and w be the weight of the edge (v,-,v,), then 

f w if there is an edge from node v, to node Vj 
\0 otherwise 

Thus M is defined as a traditional state transmission matrix from Markov chain theory [28] and each entry niij represents the 
probability to transition from state / to state j. The exact probability of each entry depends on both the topology of the underlying 
communication graph and the conflict resolution algorithm. 



C. Consequences of Markov Representation 

By representing gossip over a network as a Markov chain, we are able to clearly explain and predict the number of times the 
nodes of the network will be in a particular configuration, the probability that a particular consensus will be reached, and the 
expected time that will be required to reach a consensus. 

Lemma 5: The Markov chain with state space H and transition matrix M is an absorbing Markov chain. 

Proof: We have shown, by theorem 3, that given a set of assumptions on G and the use of gossip algorithms described 
in this paper, a consensus will always be reached if given sufficient time. Furthermore, because the consensus states of G are 
fixed points, by lemma 1, the Markov chain described by M has one or more absorbing states. Thus, M is an absorbing Markov 
chain. ■ 

Because M is an absorbing Markov chain, it can be rewritten in canonical form [28], [29] as 



M': 



Q R 

I 



where Q is a submatrix that describes the probability to transition from one transient state to another and R is a submatrix 
that describes the probability to transition from a transient state to an absorbing state. This transformation is accomplished by 
reordering and swapping the rows and columns of M. 

Using M', we can quickly verify that M is absorbing by checking to ensure that Q' = as f oo [ ; ]. This works because 
if M' is absorbing, every transient state should eventually transition to an absorbing state as t tends to infinity. As this occurs, 
the probability to transition from one transient state to another approaches 0. 

As another consequence of M', the probability to transition from one network configuration, /, to another, j, after t steps is 
given by 

[Q' R + RQ + RQ2+ +RQ' 
I 



M" 



Furthermore, it is now possible to define lh& fundamental matrix [28], [29], N, as N = I + Q + Q^ H hQ'. If we allow t to 

approach infinity, then this can be rewritten as N = (I — Q)^'. 

The fundamental matrix is important because it allows us to compute the number of times the nodes of the network will be in 
a particular configuration, the probability that a particular consensus will be reached, and the expected time that will be required 
to reach a consensus. 

1) Calculating the Expected Time in Each State: 

Corollary 6: N represents the expected number of times the chain is in state j given that it starts at state 

Proof: A direct consequence of Lemma 5 [28], [29]. ■ 
2} Calculating the Distribution of Consensus States: 

Corollary 7: Let B = NR, then B,y is the probability to be absorbed by the yth absorption state given that the initial state of 
the system is the /th transient state. 

Proof: A direct consequence of Lemma 5 and Corollary 6 [28], [29]. ■ 
3} Calculation of the Expected Time to Consensus: 

Corollary 8: Let Ta = Nl, then T^i^ is the expected number of steps (or matrix multiplications) until an absorbing state is 
reached when the system starts in the /th transient state. 

Proof: A direct consequence of Lemma 5 and Corollary 6 [28], [29]. ■ 

4} Rough bounds on the Expected Time to Consensus: It is also useful to be able to calculate the upper and lower bounds of 
the expected time to consensus. 

Given the expected time to consensus, t^^, the variance of the number of steps is 



where is the column vector with ij^^ — iAiiAi [29]. Let X be a random variable representing the time to convergence, then by 
the Markov inequality P{X > a) < ^. If a = Oj then we can calculate the probability of a value being more than k standard 
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Fig. 2. The fully connected 3 node graph, ^'3. 




Fig. 3. A Markov chain for the state value distribution of K^,. 

deviations away. If a — iA + 5 and then we can calculate the probability that the expected consensus time is larger than some 
delta of itself. Finally, \f a then the Markov inequality tells us that P{X > y) — ^- Using this last value for a, we can 

see that 95% of the time consensus will be reached in less than IQIa steps. Obviously these are very broad bounds and require 
computation of the Markov matrix, M. We are currently investigating a method by which to bound the consensus time of the 
systems described in this paper without computing M. 

D. A Simple Example 

Having described how to solve for the absorption probabilities, we now provide a concrete example by considering the 
completely connected 3 node graph, ^^3, pictured in figure 2 with S ~ {0, 1} and proportional selection. 

1) Step 1: Generate the state space: There are j^j'^^l — g possible Markov states. For simplicity, let us order the Markov 
states by interpreting each one to be a binary number; thus ifz==[0 1 0]^ then the initial state values 
for each node are given as x\ — 0, X2 = 0, and xt, = 1 and we write the 2nd Markov state as [0 1]^. 

2) Step 2: Generate the Markov chain: Figure 3 represents the Markov chain corresponding to this example (figure 2) as 
a graph with each node representing one possible state configuration for the network, and each edge weight representing the 
probability of transitioning from one state to another, as determined by the direction of the edge. For this particular example it 
turns out that there are 1 1 possible adoption matrices, and so each unaggregated edge is transversed with a probability of 1/11 
due to the use of proportional selection. Removing duplicate edges to consolidate the graph produces the edge weights observed 
in figure 3. 
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3) Step 4: 
is given by 



Generate the transition matrix: The transition matrix, corresponding to the Markov chain represented in figure 3 



M: 



4) Step 5: 
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<~obabilities and expected absorption time: Placing M 
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we find that 



M' 



And so the expected number of steps spend in each state j from the initial state / is given by 



N: 



By corollary 7 the expected probability to reach a consensus on state j given the initial transient state / is given by 
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B=NR 
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0.3333' 
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0.6667 



By coroUary 8 the expected number of steps required to reach a consensus on any of the absorbing states given the initial 
transient state / is given by 



:N1 



5.5000 
5.5000 
5.5000 
5.5000 
5.5000 
5.5000 



So, based on these results we expect that regardless of the initial distribution of node states it will take on average 5.5 steps 
to reach a consensus; but the specific consensus reached will depending on the initial distribution of node states. 



VI. Theoretical Validation via Numerical Simulation 

It is important that any theoretical framework be validated against empirical, observed, or historical data. We choose to use a 
simple randomized numerical simulation for this task. We compare our theoretical predictions to empirical data for the consensus 
probabilities, B, and convergence time, t^, for a network with a single root node (figure 4a), the completely connected 4 node 
network, K4 (figure 4b), and the random network in figure 4c. For each network there two possible states per node and proportional 
selection is used as the conflict resolution algorithm. We examine the consensus behavior that results from initializing the network 
with every possible non-consensus network state. Each empirical data point is the mean over 1,000 replications. 

We will claim that our theory is valid if the theoretical predictions and empirical data are approximately equal, such that the 
empirical data for the consensus probabihty is within 5% of the theoretical value and the mean empirical consensus time for all 
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(c) 

Fig. 4. The following networks are used to validate our Markov-based analysis framework: (a) a rooted network, (b) the complete network on 4 nodes, and 
(c) a random network. 



initial states is statistically equal to the corresponding expected theoretical consensus time according to Student's T Test with 
a — 0.05. Due to the nature of pseudo-random numbers we do not expect our theoretical and empirical results to be exactly 
equal to each other 

To simplify notation, we will favor writing the states of the Markov chain as strings of digits, where the /th digit represents 
the value of the ith node, as opposed to using vector notation. 

A. Simulation Description 

We represent a system of gossiping agents as the linear dynamical system x(r + l) = A(f)x(f) where x is the state vector at 
time t and A is the adoption matrix at time f. The adoption matrix is determined by the conflict resolution algorithm currently 
being used by the system and the agent's communication network, G. 

At each time step the simulation executes the following operations. First, the transmission matrix, T, is generated as a uniform 
random matrix. This construction is done column by column such that there is only a single 1 in each column. Next, in preparation 
for conflict resolution, any row of T that consists entirely of zeros is replaced by the corresponding unit vector e such that e, = 1 
for the /th row and otherwise. Once T has been fully generated, it is used to create the value matrix, Tdiag{ii{t)). This value 
matrix is then used by the selected conflict resolution algorithm to generate the appropriate adoption matrix, A. Upon generation 
of A, the state vector is updated and time is incremented. The simulation comes to a halt when either a user-defined maximum 
time value is reached, or all values of the state vector x(f) are epsilon equal, as defined by the equation max{x{t)) — min{x{t)) < s. 

While the simulation is running we collect the state of every agent at each time step. We also record the halting time. If the 
vector of states at the final time step are all equal, then the corresponding value is the consensus state of the system. By running 
a simulation for multiple replications, we are able to calculate the mean halting time as well as the upper and lower bounds for 
the 95% confidence interval of the mean halting time. We are also able to use the count of each consensus state to determine the 
probability of converging to a specific consensus value when starting from a specific initial state. Likewise, we can approximate 
the convergence probability for a random initial state. 

B. Probability to Converge to a Specific Consensus State 

Table I shows the theoretical and empirical probability of consensus over a rooted network (figure 4a), K4 (figure 4b), and a 
random network (figure 4c). The column header State refers to the state of each node in the network such, going left to right, 
that digit / represents the value of the /th node; x, = c is the theoretically expected probability that the network will reach a 
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TABLE I. Theoretical and empirical consensus probabilities for a rooted network, Kt, and a random network. 



(a) Rooted Network (Fig. 4a) 1 1 (b) K4 (Fig. 4b) 1 1 (c) Random Network (Fig. 4c) 





Theoretical 


Empirical 


Error 


Tiieoretical 


Empirical 


Error 


Theoretical 


Empirical 


EiTor 


State 


X, = 1 


X, = 2 


X, = 1 


X, = 2 


En"or 


X, = 1 


X, =2 


X, = 1 


X, = 2 


EiTor 


X, = 1 


X, =2 


x„ = 1 


x<, = 2 


Error 


1112 


1.00 


0.00 


1.00 


0.00 


0.00 


0.75 


0.25 


0.74 


0.26 


0.01 


0.83 


0.17 


0.80 


0.20 


0.03 


1121 


1.00 


0.00 


1.00 


0.00 


0.00 


0.75 


0.25 


0.74 


0.26 


0.01 


0.80 


0.20 


0.81 


0.19 


0.01 


1 122 


1.00 


0.00 


1.00 


0.00 


0.00 


0.50 


0.50 


0.50 


0.50 


0.00 


0.63 


0.37 


0.67 


0.34 


0.04 


1211 


1.00 


0.00 


1.00 


0.00 


0.00 


0.75 


0.25 


0.75 


0.25 


0.00 


0.66 


0.34 


0.68 


0.32 


0.02 


1212 


1.00 


0.00 


1.00 


0.00 


0.00 


0.50 


0.50 


0.52 


0.48 


0.02 


0.49 


0.51 


0.47 


0.53 


0.02 


1221 


1.00 


0.00 


1.00 


0.00 


0.00 


0.50 


0.50 


0.52 


0.48 


0.02 


0.46 


0.54 


0.50 


0.50 


0.04 


1222 


1.00 


0.00 


1.00 


0.00 


0.00 


0.25 


0.75 


0.26 


0.74 


0.01 


0.29 


0.71 


0.30 


0.70 


0.01 


2111 


0.00 


1.00 


0.00 


1.00 


0.00 


0.75 


0.25 


0.75 


0.25 


0.00 


0.71 


0.29 


0.68 


0.32 


0.03 


2112 


0.00 


1.00 


0.00 


1.00 


0.00 


0.50 


0.50 


0.52 


0.48 


0.02 


0.54 


0.46 


0.51 


0.49 


0.03 


2121 


0.00 


1. 00 


0.00 


1.00 


0.00 


0.50 


0.50 


0.50 


0.50 


0.00 


0.51 


0.49 


0.52 


0.49 


0.01 


2122 


0.00 


1. 00 


0.00 


1.00 


0.00 


0.25 


0.75 


0.26 


0.74 


0.01 


0.34 


0.66 


0.33 


0.67 


0.01 


2211 


0.00 


1.00 


0.00 


1.00 


0.00 


0.50 


0.50 


0.50 


0.50 


0.00 


0.37 


0.63 


0.34 


0.66 


0.03 


2212 


0.00 


1. 00 


0.00 


1.00 


0.00 


0.25 


0.75 


0.25 


0.75 


0.00 


0.20 


0.80 


0.19 


0.81 


0.01 


2221 


0.00 


1. 00 


0.00 


1.00 


0.00 


0.25 


0.75 


0.23 


0.77 


0.02 


0.17 


0.83 


0.19 


0.81 


0.02 



consensus on state c; and Xg = c is the empirical probability that the network will reach a consensus on state c, as determined 
by aggregating the simulation data. The Error column displays the absolute error between the theoretical and empirical values. 

The primary observation to be made from table 1 is the difference between every theoretical value and its corresponding 
empirical value is less than 0.05. Thus, as per our criteria for validity, we claim that our theory correctly estimates the behavior 
of unconstrained gossip algorithms when conflict resolution is handled via proportional selection. 

Additional observations provide insight into how the topology of a network affects the probability to reach a consensus on a 
specific state. On the rooted network depicted in figure 4a, the state of node 1 (the left most node) determines the consensus 
of the system. For example, the network represented by state 1212 converges to state 1111; but if the network is initialized 
according to state 2211 it will converge to state 2222. On K4 (figure 4b), the probability to reach a consensus on a particular 
state appears to be related to the ratio of the individual node states in the initial state distribution. For example, the initial state 
1121 has a 75% chance reach a consensus on state 1 and a 25% chance to reach a consensus on state 2. This behavior appears 
to be unique to the proportional selection algorithm acting on a completely connected network and most likely arises as a result 
of the uniform initialization of states, the uniform selection of states during conflict resolution, and the uniform distribution of 
transmission probabilities due to the completely connected topology of the underlying network. Data obtained from gossip over 
a simple random network (figure 4c provides insight into the effects of gossiping over asymmetric and non-rooted networks. 
These results illustrate that some form of computation or analysis is required to determine the consensus probabilities for all but 
the most trivial networks. 

C. Time Required to Reach a Consensus 

Table 11 shows the theoretical and empirical values for the average time required to reach a consensus over a rooted network 
(figure 4a), K4 (figure 4b), and a random network (figure 4c). The column header E[t] is the theoretically expected time till 
a consensus state is reached by the network; jXt is the mean time until a consensus is reached as determined by simulation; 
95%CIpL, is the 95% confidence interval of /!,; p-value is the p-value for Student's T Test between the mean time of the empirical 
data and the expected consensus time. If the p-value is greater than 0.05 then the empirical consensus time is statistically equal 
to the theoretical consensus time. 

The primary observation to be made from table II is that there is inconsistency in the statistical equality between the empirical 
consensus time and the theoretical consensus time. In the case of the rooted network, the empirical time is statistically less than 
the theoretical time; indicating that our simulation converges to a consensus faster than expected value. In the case of the A4 
network, the empirical time is statistically greater than the theoretical time; indicating that our simulation converges slower than 
the expected value. Finally, in the case of the random network, the empirical and theoretical times are statistically equal for 
some initial states and not for others. These results appear to be caused by our use of pseudorandom numbers in the simulation. 
Because the networks tested are small (only 4 nodes), the distribution of node states is not very "random". As the number of 
nodes in the network increases, however, the mean empirical consensus time approaches the theoretical expected consensus time 
and the p-values increase in response. An example of this behavior can be observed in table III, where each column represents the 
95% confidence interval for the mean percentage of initial states that converge to a consensus with a mean empirical speed that 
is statistically equal to the expected theoretical consensus time. These values are based on 1000 replications of the simulation. 
Based on our findings, we assert that our theory is valid with the footnote that expected times should only be taken as a rough 
approximation in the case of very small networks. 

Further observation of the data in table II reveals that across all networks examined, multiple states within each sample network 
require the same amount of time to reach a consensus. For example, under the rooted network depicted in figure 4a the states 
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TABLE II. Theoretical and empirical average consensus times for a rooted network, K4, and a random network. 



(a) Rooted Network (Fig. 4a) || (b) (Fig. 4b) || (c) Random Network (Fig. 4c) 





Theoretical 




Empirical 




Theoretical 




Empirical 




Theoretical 




Empirical 




State 


£[,] 




95%C/M, 


p- value 


£[,] 






p-value 


1 £[,] 




95%C/M, 


p-value 


1112 


3.99 


2.64 


(Z.4U, Z.oo) 


0.00 


6. 13 


6.66 


/A 1 Q 1 } A\ 
(o.lo, /.14) 


0.03 


5.21 


5.05 


(A f^A ^ A^\ 
(4. DO, J.4j) 


0.43 


1121 


3.99 


2.52 


/tin 

(z.jU, Z. /4) 


0.00 


6. 13 


6.74 


(D.Z/, /.ZU) 


0.01 


5.82 


4.70 


(4. J J, J.U /) 


0.00 




D.Z / 






u.uu 


7.73 


Q 'iA 


/Q Ql OC"i 

tfi.oj, y.oj ) 


u.uu 


7.75 


7. 15 


{o.Ji, I.Jo) 


U.Ul 


1211 


5.98 


3.52 


(3.27, 3.77) 


0.00 


6.13 


6.74 


(6.26, 7.21) 


0.01 


7.33 


6.43 


(6.04, 6.83) 


0.00 


1212 


7.97 


5.49 


(5.23, 5.75) 


0.00 


7.73 


9.01 


(8.51, 9.51) 


0.00 


8.31 


7.62 


(7.21, 8.03) 


0.00 


1221 


7.96 


5.65 


(5.39, 5.90) 


0.00 


7.73 


8.55 


(8.10, 9.00) 


0.00 


8.13 


7.56 


(7.13, 7.99) 


0.01 


1222 


9.56 


6.58 


(6.33, 6.83) 


0.00 


6.13 


7.02 


(6.52, 7.52) 


0.00 


7.14 


6.84 


(6.42, 7.25) 


0.15 


2111 


9.56 


6.56 


(6.32, 6.81) 


0.00 


6.13 


6.86 


(6.36, 7.37) 


0.00 


7.14 


6.87 


(6.45, 7.29) 


0.21 


2112 


7.96 


5.57 


(5.30, 5.84) 


0.00 


7.73 


8.88 


(8.40, 9.36) 


0.00 


8.13 


7.13 


(6.73, 7.53) 


0.00 


2121 


7.97 


5.33 


(5.07, 5.59) 


0.00 


7.73 


8.91 


(8.39, 9.43) 


0.00 


8.31 


7.91 


(7.44, 8.37) 


0.09 


2122 


5.98 


3.61 


(3.37, 3.85) 


0.00 


6.13 


7.32 


(6.77, 7.87) 


0.00 


7.33 


6.64 


(6.24, 7.05) 


0.00 


2211 


6.27 


4.51 


(4.26, 4.76) 


0.00 


7.73 


9.18 


(8.66, 9.69) 


0.00 


7.75 


7.29 


(6.87, 7.70) 


0.03 


2212 


3.99 


2.83 


(2.59, 3.08) 


0.00 


6.13 


7.22 


(6.73, 7.72) 


0.00 


5.82 


4.79 


(4.37, 5.20) 


0.00 


2221 


3.99 


2.58 


(2.36, 2.80) 


0.00 


6.13 


6.65 


(6.19. 7.11) 


0.03 


5.21 


5.17 


(4.77, 5.58) 


0.85 



TABLE III. Growth in the statistical equality of consensus times over complete networks with proportional selection. 



4 nodes 


5 nodes 


6 nodes 




(0.00, 5.00)% 


(56.54, 90.13)% 


(82.75, 97.8? 


))% 



1112, 1121, 2212, and 2221 all have the same expected consensus time. This observation suggests that the initial state of a 
network can have a serious impact on the time required to reach a consensus and begs the question, "what do these initial states 
have in common?" We introduce a notion of distance between initial states and consensus in order to quantify the commonalities 
between initial states and provide one answer this question. 

D. Distance to Consensus 

We have shown that when information is exchanged over a network through unconstrained gossip it is possible that multiple 
initial states may result in the same level of performance. To explain why this might be the case, we postulate that states with 
similar consensus times are also of a similar distance to consensus. In this context, "distance to consensus" refers to the distance 
from a specific initial network state to any consensus state. 

Definition 1: Given the network G = {V,E), let /i e // be a distribution of node states in the network and c be a specific 
consensus state of the network. Both h and c are vectors with the ith element representing the state of the ith node in the 
network. We define the distance between h and c as 

|v| 

D//(/i,c) = ^5"'(/!,,c,) where 5"^(/z,c) = 
(=1 

It is no coincidence that 5h is essentially the hamming distance. 

Definition 2: Given the network G = {V,E), let h E H be the initial distribution of node states in the network and C be the 
set of all possible consensus states; e.g. Ci G C = [1 1 • • • 1]. Furthermore, let P{c\h) be the probability that the network 
reaches a consensus on state c given that it initialized in state h. We define the expected distance between h and C to be 

D(/i,C) = ^P(c|/i)*5//(/i,c) 

cec 

The expected distance between h and C is the "distance to consensus" from h. 

It is important to note that our definition of distance accounts for neither the number of possible node states nor the number of 
nodes in the network. As a consequence, we are able to compare distances across networks of varying topologies. For example, 
consider the two completely connected networks K4 and K5 that each appear to be one step a particular consensus state. Is the 
distance from state h—[l 1 2 1] to consensus equal to the distance from h' = [l 1 1 2 1]? It turns out that the answer 
is no, they are not exactly equal. The state h is 1.5 units from consensus, while the state h' is 1.6 units because while both 
states are a single step from a consensus on state 1, K4 is three steps from a consensus on state 2 while is 4 steps from a 
consensus on state 2. This result poses yet another question, "when are two states close to one another in terms of the distance 
to consensus?" To answer this question, we offer up the following definition. 



fl, if/i^^c 
1 0, if /i c 
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Definition 3: Two states, hi and h2, are "close" to one another in terms of their distance to consensus if round{h\) — round{h2); 
where round{.) — is the elementary rounding function that rounds a real number to the nearest integer 

This notion of two states being close to one another in terms of their distance to consensus is especially useful in the empirical 
investigation of various network properties on the time required to reach consensus. As we shall see, because the consensus time 
differs depending on the initial state, and the initial states can be partitioned based on their distance from consensus, we are able 
to reduce error by conducting analysis on the partitions of states as opposed to aggregating effects over all initial states. 

Furthermore, the ability to classify performance based on the distance to consensus implies that if the performance of one initial 
state in a class is known, the performance of all other states in that class will be similar. The exact computational requirements 
needed to classify each state will differ depending on the underlying network and the conflict resolution algorithm, but in the 
case of certain configurations, such as a completely connected network with proportional selection, it is trivial to determine if 
two states are in the same class**. 

VII. Influences on Consensus Time: The Effect of States, Nodes, and Density 

When applying absorbing Markov chains to analyze the behavior of unconstrained gossip algorithms, it becomes clear that 
the number of nodes in the network, the number of states each node can represent, the topology of the network, and the conflict 
resolution algorithm should all be determining factors in the length of time required to reach consensus. The number of nodes 
and and node states determine the size of the Markov chain. The network topology and conflict resolution algorithm determine 
the transition probabilities. 

Now that we have shown our analytical framework to be valid, we briefly explore the impact of the Markov chain state space 
size and communication network density on the consensus time of small networks operating under unconstrained gossip with 
proportional selection. 

A. Expectations 

It is our expectation that under proportional selection the time required to reach consensus will increase as the the number of 
nodes in the communication network and the number of possible states that each node can assume increases - regardless of the 
underlying topology. This expectation is based on the growth of the state space for the associated Markov chain. 

We also expect that, in general, the time required to reach a consensus will not necessarily decrease as the density of the 
underlying network increases. This expectation is due largely to the random nature of gossiping. Dense networks impose a larger 
number of choices for each node to make on average. In the worst case, this can result in an increase in the consensus time due 
to poor choice of transmission paths. In the best case, the optimal consensus sequence can be selected and result in the shortest 
consensus time. 

B. The Impact of Nodes and Node States on Consensus Time 

Given a network of n nodes with k possible states per node, each network state in the Markov chain can be represented as a 
vector of n elements with each element taking on a value between 1 and k, inclusive. Under this representation adding one more 
node to the network is equivalent to increasing the total number of states in the Markov chain from k" to and adding one 
more possible node state is equivalent to increasing the total number of states in the Markov chain from k" to {k + 1)". Thus, it 
is reasonable to expect that as the number of nodes in a network and the number of possible states per node grow larger, adding 
one more node will produce many more Markov states than adding one more possible node state. This increase in the size of 
the Markov chain should have a direct impact on the time required to reach a consensus, with larger chains requiring more time 
as a result of the possible states that a network can end up in. Additionally, because the initial states of the network can be 
partitioned by their distance from consensus, we expect that the impact of adding nodes and node states will be more prevalent 
when the initial state of the network is far from consensus. 

Examples of the consensus time behavior under bidirectional completely connected (figures 5a, 5b, and 5c), star (figures 5d, 
5e, and 5f), and ring (figures 5g, 5h, and 5i) networks are displayed in tables IV, V, and VI. These network structures were 
chosen for exploration over random and complex networks because they're easier to uniformly scale. Each table shows the 
95% confidence interval of the mean consensus time for all initial states at a particular distance from consensus at the specific 
noded/states configuration^^. An increase in the number of nodes corresponds to moving down a column, from top to bottom. An 
increase in the number of node states corresponds to moving across a row, from left to right. A value of "-" indicates that there 
was no data for the corresponding network configuration. This partitioning is important because, as we will see, the further the 
network starts from consensus the larger the impact of our measured variables. 

Table IV displays the data for initial network states that start at a distance of 1 unit from consensus. It can be observed that 



'^In the case of a completely connected network with proportional selection, the probability to converge to a particular consensus state is proportionally 
determined by the initial state (see table I). Dh is always straightforward to compute. 

'This means we averaged the consensus time for all states d units from consensus, as d ranges from 1 to the maximum observed distance. 
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(g) (h) (i) 

Fig. 5. The complete (a, b, c), star (d, e, f), and ring (g, h, i) networks used to explore tlie impact of node and node state quantities on the consensus time of 
unconstrained gossip with proportional selection. 

TABLE IV. Theoretically determined 95% confidence intervals for the mean consensus times at distance = I. 







2 states 


3 states 


4 states 


K„ 


3 nodes 

4 nodes 

5 nodes 


{5.5, 5.5) 
(6.13, 6.13) 
(7.59, 7.59) 


(5.5, 5.5) 
(6.13, 6.13) 
(7.59, 7.59) 


(5.5, 5.5) 
(6.13, 6.13) 
(7.59, 7.59) 


Stai- 


3 nodes 

4 nodes 

5 nodes 


(4.79, 5.88) 
(7.11, 7.89) 
(10.01, 10.61) 


(5.09, 5.57) 
(7.31, 7.69) 
(10.16, 10.46) 


(5.18, 5.49) 
(7.38, 7.62) 
(10.21, 10.41) 


Ring 


3 nodes 
3 nodes 
3 nodes 


(5.50, 5.50) 
(5.89, 5.89) 
(7.31, 7.31) 


(5.50, 5.50) 
(5.89, 5.89) 
(7.31, 7.31) 


(5.50, 5.50) 
(5.89, 5.89) 
(7.31, 7.31) 



increasing the number of nodes in all networks results in a significant increase in the consensus time; however, increasing the 
number of states per node does not correspond to a significant increase in the consensus time. The response to increasing the 
number of node states is most likely occurs because at a distance of 1, the network is already close to consensus; typically with 
only a single node out of sync. Because so few nodes require a state change, the number of states will have little to no impact 
on the time required to reach a consensus. 

Table V displays the data for initial network states that start at a distance of 2 unit from consensus. It can be observed that 
increasing the number of nodes in all networks results in a significant increase in the consensus time; increasing the number of 
states per node results in a small, and questionably significant increase in the consensus time. The impact on the consensus time 
as a result of increasing the number of node states appears to grow in significance with the number of nodes. If the observed 
trends continue, then the impact should be clearly significant for large networks. This is in line with our expectations based on 
the growth of the Markov chain state space. 

Table VI displays the data for initial network states that start at a distance of 3 unit from consensus. As with tables IV and V 
an increase in the number of nodes corresponds to a significant increase in the consensus time. Unlike tables IV and V, however, 
an increase in the number of states per node also results in a significant increase in the consensus time. This observation supports 
our expectation that the distance between an initial state and consensus is important. 

Overall, these observations paint an interesting picture of the dynamics behind the temporal behavior of unconstrained gossip. 
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TABLE V. Theoretically determined 95% confidence intervals for the mean consensus times at distance = 2 







2 states 


3 states 


4 states 




3 nodes 

4 nodes 

5 nodes 


(7.73, 7.73) 
(10.56, 10.56) 


(7.33, 7.33) 
(8.34, 8.63) 
(10.9349, 11.1012) 


(7.33, 7.33) 
(8.57, 8.70) 
(11.13, 11.22) 


Stai- 


3 nodes 

4 nodes 

5 nodes 


(9.50, 9.50) 
(14.37, 1447) 


(7.00, 7.00) 
(10.24, 10.59) 
(14.94, 15.17) 


(7.00, 7.00) 
(10.52, 10.68) 
(15.20, 15.33) 


Ring 


3 nodes 

4 nodes 

5 nodes 


(7.20, 7.66) 
(10.00, 10.25) 


(7.33, 7.33) 
(7.99, 8.27) 
(10.46, 10.63) 


(7.33, 7.33) 
(8.21, 8.34) 
(10.64, 10.73) 



TABLE VI. Theoretically determined 95% confidence intervals for the mean consensus times at distance = 3 



I 2 states 3 states 4 states 



K„ 


3 nodes 

4 nodes 

5 nodes 


(12.70, 12.70) 


(9.99, 9.99) 
(13.03, 13.10) 




3 nodes 






Star 


4 nodes 




(12.25, 12.25) 




5 nodes 


(17.35, 17.36) 


(17.81, 17.91) 


Ring 


3 nodes 
3 nodes 
3 nodes 


(12.07, 12.14) 


(9.54, 9.54) 
(12.41, 12.48) 



Across all three graphs that we examine, increasing the number of nodes in the network results in an increase in the consensus 
time. Increasing the number of states per node, however, has different effects depending on the distance of the initial network 
state. When the initial network state is close to consensus, the adding more possible states per node has little to no impact on 
the consensus time. As the distance between the initial network state and consensus increases, adding additional choices for each 
node states results in a more significant impact on the consensus time. 

In general, our results suggest that the consensus time increases with both the number of nodes in the network and the number 
of possible states per node. They also reflect our expectations of behavior as a result of Markov chain analysis. The Markov state 
space grows faster via the addition of nodes than it does via the addition of states per node. Under this perspective, one possible 
cause for the increase in consensus time is simply that it takes longer to reach a consensus because the state space is larger, and 
thus the likelihood of encountering the optimal consensus sequence decreases and the average path length to a consensus state 
increases. 

C. The Impact of Network Density on Consensus Time 

So far we have established that the consensus time of unconstrained gossip under proportional selection is largely influenced 
by the number of nodes in the underlying communication network. This finding is in line with the existing research on the 
average gossip algorithm and gossip algorithms that do not account for conflict among transmissions. We have also shown that 
the number of states per node can have a significant impact on the consensus time, but that the significance of that impact varies 
with the distance of the initial network state from consensus. To the best of our knowledge, similar results concerning the number 
of states per node have not been documented. 

We now investigate how the density^' of a communication network influences the consensus time. Network density is worth 
investigating because it impacts how easily information can spread between nodes. If a network is too sparse, it may lack a 
directed spanning tree and thus be unable to produce a consensus. As a network grows denser, it is possible for multiple directed 
spanning trees to exist, thereby providing multiple options for consensus formation, with some consensus sequences requiring 
more time than others to complete. Regardless of the density, if the network is well connected (see theorem 3) then there exists 
a shortest consensus sequence. The existence of this shortest consensus sequence is why we do not necessarily think that the 
density of a network matters; no matter how many edges get added to a network the consensus sequences they produce can be 
no better than the shortest and there is no guarantee that they will be used for transmission. In terms of our Markov modeling 
framework, the network density has a direct impact on the transition probabilities of the Markov chain. An increase in density 
has the potential to dilute the probability of transmitting along the shortest consensus sequence, but it does not nullify it. 



'We use the standard definition of networit density as the ratio of existing edges to total possible edges. 
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Network Density vs. Consensus Time 



E 
i- 

VI 

c 

OJ 

c 
o 

o 



0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 
Density 

Fig. 6. The consensus time as the density of an arbitrary network with 5 nodes and 3 states per node increases from 0.6 to 1.0 in increments of 0.05. Each 
line represents a different initial distribution of node states; the exact state of each line is irrelevant. Each bar represents the 95% confidence interval of the 
associated data point. 

We conduct this investigation by constructing the completely connected five node network with 3 possible node states and 
then removing edges at random until a desired density is reached. To account for the multiple configurations that can occur when 
edges are removed, we average the consensus time of 30 graphs at each density value. 

Figure 6 shows the results of our investigation as a plot of the consensus time as the density of an arbitrary network increases 
from 0.6 to 1.0 in increments of 0.05. Each line represents a different initial distribution of node states. The bars on each 
line bound the 95% confidence interval of the mean consensus time at the corresponding density value. There are two primary 
observations that can be made from this data. The first is that the initial states cluster together into partitions. This is the same 
behavior that was observed during our validation experiments; see table I and 11. Not only can the initial network states be 
partitioned, but these partitions become clearer as the network becomes denser. At a density value of 0.6 the boundaries between 
three of the four partitions are fuzzy, but once the density of the network reaches 0.7 it is clear which initial states belong to 
each partition. The second observation is that regardless of the initial state, there does not appear to be a significant statistical 
increase in the consensus time as the density of the network increases, but the slight upward trend suggests that as the density 
increases the system may be more likely to perform at the slower end of the performance spectrum; as evidenced by the increase 
in the lower bound of the confidence intervals while the upper bounds remain fairly fixed. For instance, at a density of 0.6 on 
the examined network an initial state one unit away from consensus it should take, on average, between 6.5 and 8 steps for a 
consensus to be formed; however, when the density increases to 0.95 is should take an average of 7.5 steps to form a consensus. 

As previously mentioned, this observed behavior may be due to the additional directed spanning trees that appear as a network 
becomes more connected. As more viable paths to consensus become available it is feasible that the probability of moving 
along the shortest path decreases, and as a result longer transmission routes are more likely to be selected, thus resulting in the 
possibility (but not guarantee) of an increased time to consensus. 

VIII. Conclusions 

Gossip algorithms are widely used to solve the distributed consensus problem on networks, but issues can arise when nodes 
receive multiple signals either at the same time or before they are able to finish processing their current work load. In the 
real world, it can often be hard to limit the amount of information a node is exposed to, especially as networks become larger 
and their nodes more complex. To address these issues, we introduce the notion of conflict resolution for unconstrained gossip 
algorithms and prove that their application leads to a valid consensus state when the underlying communication network possesses 
certain properties. We also introduce a methodology that is based on absorbing Markov chains for analyzing unconstrained gossip 
algorithms that makes use of these conflict resolution algorithms. This technique allows us to calculate both the probabilities of 
converging to a specific consensus state and the time such convergence is expected to take. Finally, we make use of simulation 
experiments to verify and supplement our theory with additional results. 

We show that the number of nodes in a network, the initial state of the network, and specific network topology are all 
critical factors in determining how long it will take for consensus formation. The number of possible states that each node can 
assume has much less impact on system performance than a increase in the number of nodes. Furthermore, the significance of 
this impact varies with the initial distribution of node states. This finding has important implications for deriving bounds on 
consensus time for the behavior of unconstrained gossip algorithms without first requiring computation of the Markov transition 
matrix. Specifically, it suggests that it might be possible to leverage existing techniques from the existing body of research on 
gossip algorithms, as well as techniques from the opinion dynamics and voter model literature in order to study unconstrained 
gossip when assumptions are made on the initial state of the network. We think that such a bridge is not only a good idea, but 
essential for future advancements in distributed problem solving. 
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